From Acoustics to Articulation

نویسنده

  • G. ANANTHAKRISHNAN
چکیده

The focus of this thesis is the relationship between the articulation of speech and the acoustics of produced speech. There are several problems that are encountered in understanding this relationship, given the non-linearity, variance and non-uniqueness in the mapping, as well as the differences that exist in the size and shape of the articulators, and consequently the acoustics, for different speakers. The thesis covers mainly four topics pertaining to the articulation and acoustics of speech. The first part of the thesis deals with variations among different speakers in the articulation of phonemes. While the speakers differ physically in the shape of their articulators and vocal tracts, the study tries to extract articulation strategies that are common to different speakers. Using multi-way linear analysis methods, the study extracts articulatory parameters which can be used to estimate unknown articulations of phonemes made by one speaker; knowing other articulations made by the same speaker and those unknown articulations made by other speakers of the language. At the same time, a novel method to select the number of articulatory model parameters, as well as the articulations that are representative of a speaker’s articulatory repertoire, is suggested. The second part is devoted to the study of uncertainty in the acousticto-articulatory mapping, specifically non-uniqueness in the mapping. Several studies in the past have shown that human beings are capable of producing a given phoneme using non-unique articulatory configurations, when the articulators are constrained. This was also demonstrated by synthesizing sounds using theoretical articulatory models. The studies in this part of the thesis investigate the existence of non-uniqueness in unconstrained read speech. This is carried out using a database of acoustic signals recorded synchronously along with the positions of electromagnetic coils placed on selected points on the lips, jaws, tongue and velum. This part, thus, largely devotes itself to describing techniques that can be used to study non-uniqueness in the statistical sense, using such a database. The results indicate that the acoustic vectors corresponding to some frames in all the phonemes in the database can be mapped onto non-unique articulatory distributions. The predictability of these non-unique frames is investigated, along with verifying whether applying continuity constraints can resolve this non-uniqueness. The third part proposes several novel methods of looking at acousticarticulatory relationships in the context of acoustic-to-articulatory inversion. The proposed methods include explicit modeling of non-uniqueness using cross-modal Gaussian mixture modeling, as well as modeling the mapping as local regressions. Another innovative approach towards the mapping problem has also been described in the form of relating articulatory and acoustic gestures. Definitions and methods to obtain such gestures are presented along with an analysis of the gestures for different phoneme types. The relationship between the acoustic and articulatory gestures is also outlined. A method to conduct acoustic-to-articulatory inverse mapping is also suggested, along with

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving on Hidden Markov Models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

The goal of the proposed research is to test a statistical model of speech recognition that incorporates the knowledge that speech is produced by relatively slow motions of the tongue, lips, and other speech articulators. This model is called Maximum Likelihood Continuity Mapping (Malcom). Many speech researchers believe that by using constraints imposed by articulator motions, we can improve o...

متن کامل

Validating auralizations by using articulation indexes

In this paper, computer modeling auralizations are validated by using articulation indexes. As it is well known, one of the main challenges in validating an auralization is how to find an objective metrics to evaluate its quality. The generation of acoustical virtual reality is done by the proprietary computer code RAIOS (Room Acoustics Integrated and Optimized Software), which includes sets of...

متن کامل

The Oro-pharyngeal articulation of nasal vowels: Problems and perspectives

This paper outlines the prospectus for an instrumentallybased approach to the articulation of nasal vowels. Complexity in the acoustics of nasal vowels has long been acknowledged but complexity in their articulation has received less attention. A growing body of research suggests that velopharyngeal opening (VPO) is complemented by other articulatory gestures which may enhance or counteract the...

متن کامل

Cambridge University Press 978 - 1 - 107 - 01834 - 1 - Second Language Speech : Theory and Practice

acoustic analysis consonants. See entries ‘acoustics’ under approximants/fricatives/laterals/nasals/ rhotics/stops measurement. See under Praat preparing data for analysis, 123–124 spectrograms and waveforms, 136, 137 vowels. See vowels: acoustics allophones acquiring allophonic distributions, 202, 213 acquiring TL phonemes that are L1 allophones, 148, 217, 241–244 learning a new allophonic dis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011